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Dominique Perrin 
L.I.T.Pw Universite Paris 7 

INTRODUCTION 

Many interesting combinatorial problems on words deal wit* 
rearrangements of words. One of tne goals of such rearrangements is to provide 
toijective mappings between sets of words satisfying certain properties and 
therefore give some enumeration results on words. The interested reader may 
consult the Chapter try l>» roata in Lothaire's book £11} where some exaimsles of 
rearrangements are developped. The algorithms involved in such rearrangements 
are. by the way. close to the more popular ones einee many sorting problems 
can' usefully be formulated in terms of rearrangements. The study of 
rearrangements has lead D. roata to consider words over an alphabet in which 
some of the letters are allowed to commute* And this, in turn, could have 
raised the interest for studying 44 in abetracto" problems concerning these 
words and the structure of the commutation monoids which is their habitat. 
Nonetheless, it happened on the contrary that words on partially commutative 
alphabets became of interest to computer scientists studying problems of 
concurrency control. Koughly speaking, the alphabet considered in this 
framework is made of functions and the commutation between these functions 
corresponds to the commutation of mappings under composition. A typical 
problem is then to decide wether, up to the commutation rule, a given word is 
equivalent to one in a special form (see Cl3l, chapter 10 for an exposition of 
this problem). My own interest in such questions was motivated by the work of 
M,P, Pis and g. Roucairol C9 3 who proved a surprising result on finite 
automata in commutation monoids motivated by problems of concurrency control, 
The aim of this paper is to present a survey of results obtained recently on 
commutation monoids including a generalisation of the above mentionned, Xt 
does not intend to be a comprehensive exposition and many facets of the 
question have been left in the datfc< The first section introduces some 
terminology and definitions. The second section contains the discussion of two 
normal form theorems in commutation monoids, The third section contains some 
results on the structure of commutation monoids. Finally* in the last section, 
I will discuss the problem of finite automata and commutation monoids, 

1;' TREE •PPggrXRIiEfY COMHlTT&TIVE'WITOIPg - — • - ■ 

we shall consider a finite alphabet A and a binary relation © on 
the set A . We suppose that G is symmetric. For two letters a, 6 « A, we 
denote t 

= oa 

whenever the pair (a,b) is in relation by e. we consider the congruence of 
A* generated by this relation and we denote 

u m v mod. 9 

if the two words u.v are congruent. This means that there exists words 
w 0 . w 1r . . . r w k with w Q = u,w H = v and for each / (0*i*H-l > w f - rj*os /r 
w l+1 a r^asf with (a,b> « ©. 



NATO AS] Sori«u. V&LF12 
Combinatorial Algorithms on Words 
Edited by A Apostolico and Z. GaJil 
© SpiiftgftM/itrfeg Berth) Heidelbos 1985 
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we donate M(A,&) the quotient of A* by the congruence =. It is 
called the tree partially commutative monoid generated by A with respect to 
the relation e. Such a monoid will also be called a commutation monoid. 

- A simple example of a commutation monoid A* obtained by considering a. 

direct product 



M - A* 1 x A*2 *• • ■* A* ft 



of free monoids. 

A useful characterisation of equivalent words, proved in CGI is the 
following . 

we denote by lvl fl the number of occurrences of the letter a in 
the word u. Fox a subset 3 of A, we denote by tt s <i/) the projection of 
u on 8* Which is obtained by erasing all letters which are not in 8 . 

proposition 1.1 . One has 

v «= v mod. e 

iff 

<i) for each a e A t \u\ a ivi fl 

(ii) tor each pair a,b of tatters such that (a,b> £ © one has 



rr a t b <ti) K n a,b 



(v) 



Proof ! "The conditions are obviously necessary, conversely, we can use an 
induction on the common length of u,v. The property is clear if U,Y are 
letters. For the induction step, let u - au', v = bv 4 with a,b e A and 
tr p / « A*, if s * b, we are done by the induction hypothesis. We suppose 



Since I Lf I 



a 



we have 
or. 



\v\ B > 1 . Let V » rsfi 
if we had (a, c> 4 O, 



with 
then 



irl^ * 0, fce* 
(v) 



c be any letter occurring in or, n we naa ta.cj * unen w a,c 
would begin by a and ff a ,c (v ^ would begin by e, a contradiction. 
.Therefore ...or*-*, abr*. Now- the. pair ... *i% b/tft . satisfies the conditions (O.and., 
< ii ) and therefore the two words are equivalent by the induction hypothesis . 
Hence 



u * aw = abrs = bras = v. o 

There is a representation of the elements of the monoid 
labeled directed graphs which is obtained as follows. 



M (A,0) by 



We shall associate to each word u e A* a graph 0<u) called its 
dependency graph. Let u = a^. . *7r witti ay e A. The vertices of G are 
. ,n and integer f is labeled by the letter a/. There 
iff 



the integers 1,2, 
is an edge (IJ) 



(i) 
(ii) 



/ < I 

ca h ap * e 



or d/ = aj 
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(iii) fox each H with I < k < f ve haw S/ * a* 

The number of edges An G(u) is at most *n where k = CsrtfW . In 

fact, for each /, the number of edges f/,/) is at most * since a; * a* 

when </,>>-- and *(*,/> are edgea. 

PROPOSITION 1.2 . One G<i» - G(v) Iff u = v. 

Proof - If ti v then certainly Q(u) = Gfv> since this implication is 

obvious for u = rabs, v = rbas with fa,o> « e. Conversely, we uee 

Proposition 1.1. condition (1) is trivially satisfied. Condition (ii) is alao 

satisfied since there is an edge between any two consecutive positions of each ; 

t** 3 *a,b <u) ' n 9,b< v >' ° 

A simple algorithm to compute the dependency graph of u consists in 
scanning u from left to right and keeping for each letter an integer 

indicating its last occurrence in the left factor of u that has been j 
scanned . J 

•! : 

an example of a commutation monoid can be obtained as follows, Let 
/. 8 he two seta. The set M of mappings 

i 

m : i - fi* ; 
has a structure of a monoid with a product defined by ! 
mn(i) - m(t)n<i) 

It is isomorphic to the direct product of Card (I) copies of B* and 
therefore is a eonroutation monoid. 

When i = fl, this monoid is called in C3 3 the fiow monoid and its 
elements are called flows. 

2. NORMAL FORMS 

We present in this section two normal form theorems. The first one is 
due to D. Poata and the second one to D._ Knutn. _ 

we begin with Poata Normal Form, Let MCA, B) be a free partially 
commutative monoid. We suppose that the alphabet A is totally ordered. 

We say that a word w « A* is in Foata Normal Form if 

w = W;!/^. . .</„ (2.1) 

where the words ^i,U2 u n 3ati3f V following conditions : 

(i) each word (// is a non empty product of distinct letters 
commuting with each other in increasing order 
(ii) if 8 is a letter of f not appearing in u) 
then there is a letter 6 of (// such that 

fa,o; fi ©. 
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1 

I 



■ !i 



1 1 ! 



■ : . • | 



.1 

II 



I 

• V 4 



Roughly speaXing, a void ia in normal form When all its letters nave 
oeen pushed at the left as much as possible to form Modta of distinct 
letters. 

for instance, in the free commutative monoid over A - the 
word = Cao) (a> <a) ±s in normal form with blocks indicated fey the 
parenthesis . 

If w is in formal form, there ia a unique way of factorizing w in 
(2.1). in fact u 1 ia the largest laft factor of w composed with distinct 
letters that commute with each other. 

The following result appears in £9] (see also Ciol). 

theorem 2.1. for each word u e A* there Is a unique word v * A* in normal 
torrn equivalent to u. 

Proof s The proof consists In an algorithm computing the normal form which 
will be commented upon later on. 

Let r be the set of words w * A* which are in Poata tfonnai Form, 
that 19 satisfying conditions <i) and (li) above. We define a mapping from r 

x A into r denoted <u,a> I > v. a in the following way. Let u = 

uju 2 - . . </ n the decomposition of u as in Eq.(Z.l). Let / he the greatest 
integer f0 * > * nJ Such that the word Uj contains the letter a or a 
letter b such that Cfl, b) £ e> we use the rule that 
integer exists, we then define 



f - 0 if no such 



where v /tf is the result of inserting a in t/ l+ f to obtain a word with its 
letters In increasing order. When / = n, we just set v /+ j » a. 

It is obvious that, according to the definition, we have u.a - va. 
Also, it is easy to verify that u.a <a r. This proves the existence of a 
normal form for any word w e A w . Furthermore f let us extend the mapping 

(t/,*) i — — > u.a to a mapping (tr.O » > w v from r x A 9 - r by 

associativity. We have for each u « f and. <«>to) * 9 the equality 

v. a© = v.oa 

as readily verified from the definitions, therefore, for any words w f W « A* 
such that w ~ w' we have 1.w - 1 . W> Moreover, if w is in normal form, 
we have * . w ~ */. This proves the uniqueness of the normal form since if 
w, W are two words in normal form that are equivalent/ we have 



* = ; . w 



'■ i 
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The proof given above leads to a simple algorithm to compute the 

normal form of a word- The, normal form of a word *CM, wl21 win} Is 

computed as a sequence of blocks 311 1, sl2l sC*3 each block being a 

subset of the alphabet. At the same time, one keeps track for each letter a 
of the alphabet of a pointer plal indicating the index of " the - rightmost " 
block in which either s or a letter b with <e.b)*e occurs in the 
sequence of blocks . It is then straightforward to obtain the normal form of a 
word by scanning its letters from left to right. The only point to be made 
precise is the updating of the pointers p[bj after inserting a letter a 
on the right. This is simply obtained by the following : 

pta) ;= Pla] * 1 ; 

for each b such that (a,t>) * e and pCbJ < PCa3 *o plt>3 pCa] ; 

The resulting algorithm computes the normal form of a word of length 
n in time Otkn) where h is the number of pasrs (a.b) such that C«i&) 

it is interesting to note that there is also the possibility of 
computing the normal form of a word by another algorithm which operates from 
ngni lo leu . The algoninm is somenow sampler since it does not require *hp 
use of pointers as before. 

The idea is to use a stack for each letter of the alphabet. When 
processing a letter a in the rig^t-to-left scanning of w, the letter ie 
pushed on its stack and a marker is pushed on the stack of all the letters b 
such that (fl.b) * ©. The normal form is obtained by recursively -peeling" 
the tops off the stacks as in the following example. 

EX*HPt£ 2.1 Let A {a t b,c,d) and tot the complement B of & be given 
by the graph 



a b 




we consider the ! word 'w^^ form of the four 

stacks is * 



a 










b 






a 


« 




* 


* 


• 


* 


d 


* 


b 


w 




a 


» 


c 




* 


b 


* 


* 


a 


b 


c 


d 



The normal form is obtained as : 

(a) (b) (ad) (be) (a) <b) □ 
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■The algorithm can be implemented in the following way : we uae for each 
letter a in /V a stack «ta3 of boolean with the convention that "false" 
is the marker. The first part which consists in filling the stacks is the 
following : 



for /;=n down to 7 do ooff/ft 

a:=*tO ; push (true, stal) ; 
for ©sen b in A such that 
(feme. &lt>l) 



(a.t>) * e o*o pua/> 



The second part which computes the normal form as a word u 
initialized to 1 uses a set variable B initialised to the whole alphabet 
A. It looks like 

white 8 not empty do 

for a In B 40 

it empty CsCaP 6.'=B-a *wgto 

H fop fsCaU f^ert ©*oVn 

t/,=</a ; pop (aial) ; 

for b in Q and <a>b)£ & do pop <sZb)> 

and 
end ; 

The idea of this algorithm is related to Viennofs theory of "plleups" 
which gives an interesting geometrical model of partially commutative monoids 
C8l, The algorithm used several slacks. From the point of view of automata 
theory, the question could be raised wether it could be performed by a finite 
automaton. -The answer is easily seen to be negative* Consider indeed the 
mapping 



O : tf I — • 



-> v 



assigning to each word its unique equivalent word in normal form. For A = 
(a,b) with ab ■ ba we have 

or* ((so)*) - {« « A* I \u\ s - \u\ D ) 

This shows that 6 is not a rational function since the rigftt-hand 
side is not a rational language. 

There is, however, something related to finite automata in all this : 
the set of words in normal form is a rational language- This gives the 
following corollary* whose idea was suggested to me by Philippe Flajoiet. 

PROPOSITION _2_._a. For each n * 0. fat * a be tha number of equivalence 
classes mod. e represented by words of length n. The aeries La n 2* Is 
rational. ° 

EouiPLE z.2. tfe consider again the monoid of Example 2,1, The get of words in 
normal form is precisely the set of words having no factor equal to one of tha 
eight words 
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aad t acb, bbc f bda. ccb f cda r dcb t dda 

and that do not begin by cb or da. The computation of the generating 
series D*n* n M complicated and will be treated in Example 2.3. a 

•there is another normal Conn theorem for commutation monoi/Js . It 
consists in a characterisation of the smallest lexicographic element in each 
congruence clasa. The result is due to Anieinwv and xnuth ClJ. 

theorem 2.2. A word w c A* Is the minimal lexicographic element of Its class 
mod, B Iff me following condition Is satisfied : tor each factorization w = rbsat 
with r,$,t e A*, a.b <a A and a < b there exists a tartar c of bs such 
that (a>c) a. 

J»roof t Pirst suppose that w is minimal. Let w - rbsat with r, a,t e A* ■; 
and a,b c A, a < b. If we had baa » abs then also 

w m rabst 

but rabst < rbsat whence a contradiction. Hence there exists a letter c of 
a that does not commute with a. This proves that w satisfies the 
condition of the statement. 

Conversely, suppose that there is a word u - w with u < w. Let r 
be the longest common left factor of i/.w. Then u = r$u\ w = rbW with a 
< 0. Since \u\ a * \w\ A there is a factorisation of W in = sat with 
id I a = o. Then all the letters of s commute with a since, otherwise , we 
would not have u a v ( Therefore Mf does not satisfy the condition of the 
statement . D 

hi\ algorithm to compute the lexicographic normal form could use the 
dependency graph of section l. It consists in selecting recursively the 
smallest letter without predecessor and deleting the corresponding vertex. 

The set of words in lexicographic normal form is again a rational 
language as illustrated in the following example. 

example 2.3. ( with the bor w>noid as in Example 2,. 1# tJje. l jaet. - of .woxde jln. ...... . 

iexicegrraphic normal form is the" set of words having no factor equal to co 
or da. This gives the generating series of the numbers of classes mod. 

a of words of length n 

1 

« = — 

1 - 4 S + 2 Z 2 

Indeed, let X denote the set of words in normal form. We have 

i 

X(1 * cb -fr da? - XA + 1 
whence the above equality. ^ a 
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3 , STRPCTORg OP COWUTATION MONO EPS 

•There a family o£ partially ranmutativa free monoids which ia well- 
known and simple enough. It is the family of i«>noida W^cA di^ct 
product of free monoids, the following result, due to M. clerbout ana K. 
tatteux C41. snova that thle family of monoids gives, in a senae, the genetic 



„ 3.1. Any finitely generated commutation monoid can be embedded into 
a finitely generated monoid which Is a direct product of free monoids. 

Proof : Let M = M<A.e) with A finite. Let A c A be the aet of letters 
Which commute with an other lettere in A. 

A' - (a e A i a A, (a>b) « ©) 

let A' = i*i,a 2 a*) »r«S A" = A - A'. The monoid W la 

isomorphic with the direct product 

M<A',e) * A m f x a*^ x. . .xa% 

The proof is therefore only needed for a monoid M<A t 9) such that 
A' = a. Let 0 = A x A and let a : A* - 8* be the morphia* defined for 
a <s A by 



a(a> = <e,ajHa,a2> 



fa.eV 



where {sj,a 2 **) in s * anie order, the aet of letters b such that 

(s.b) t Since A' = 0, the morphiem o is injectlve, we define a 
relation t on the alphabet 8 by 

r fixe - {(<a.b). <b,a>) i a, b * A] 

The monoid M(B,r) is clearly isomorphic to the direct product of 
the free monoida with two generator* [t*.b) % (b,a))* for a,b a A. 

TO prove that the moronism « induces a isomorphism from M into 
M(& t r) we need to prove that for u.v.a A*— 

ii s v moo*. © <=> of(i; = afv) /noo\ t 

The direct implication followa directly from the fact that ab = o« 
incites otfacJ « a(ba; for a, 6 e A. For the converse, we use Proposition 
1.1. Let u,v a A w be auch that <*<u) a(v> mod- t- Let a e A be a 
letter, since A' ~ 0 by the diseuasion above, there ia a letter b « A 
such that <a,b> a* ©, Then 

lul e * i«fti>i ffltW = »«<*>l<a,DJ = lv, b 

Therefore, condition Ci) is satisfied. Let now. a,b a A be such that 
<a,b) * e, *hen clearly * a b (u) =. ff^o**' since the projections of *<u> , 
afW on i<a,b), <b,6>)* are equal.' Hence, by Proposition 1.1, we have u = 
v mod. e. a 



— I 
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The -embedding theorem" ha* several consequences on the structure of 
commutation monoids. Among -them, there is tbe following result Which was 
proved by R* Corl and T. Metiviex tsi. 

theorem 3.1. For dflV word u c A* f/js sef 

C(i/> = (n * A* I uv s w mod, © ) 

a rational language. 

Proof ; fcet a be a morphiem from A* into a direct product A m 1 k. . .xaV 
Since, In a free monoid, the commutator of an element la a cyclic submonoid, 
the commutator of any element of A*jx. . ,xA* n is rational. The set Ciu) is 
the inverse Image by a of a rational subset and is therefore rational. o 

Another results of (53 also proved independantly by C. Duboc C7J is 
that the linage of C(u> in the monoid M(A,e> is a finitely generated 
submonoid. The proof is based on the fact that if the graph of the complement 
of e is connected, the set Gtu? is a cyclic submonoid. 

4, frECOCNIZftBILITY IW COMMOTATIOM MONOIDS 

We will be interested in this section in the following problem i how 
can one describe the recognizable subsets of a commutation monoid ? Becall 
that, given a monoid M, a subset X of M is said to be recognizable if 
there exists a morphism 0 from M into a finite monoid F which saturates 
X, i.e. such that *""*$(X) * X. The family of recognisable subsets of M is 
denoted Roc(M) . When M is a free monoid, the recognizable sets coincide 
with the rational sets, i.e. the sets described by rational expressions. In 
the general case, a recognisable set is always rational but the converse is 
not true. 

In a monoid which is a direct product of free monoids, the family of 
recognizable sets has a simple structure : it consists of finite unions of 
direct products of recognisable subsets of each component. This comes from the 
well-Known result asserting that the recognisable subsets of a direct product 
of two monoidB are finite unions of recognizable subsets of the components 
.(.see.,.C2l.e.g.O - ■ — - - - - ■• ; " 

The situation is more complicated in general commutation monoids. Me 
shall concentrate here on the closure properties of recognizable sets under 
rational operations : union, product, star. 

Hhe closure of the family of recognizable sets under union holds, in 
any monoid . For products, we have the following result Cs] : 

THEOREM e.i. The product XY of two recognizable subsets X.Y of a 
commutation monoid Is recognizable* 

Proof i We use the "embedding theorem" 3.1. According to this theorem, it is 
enough to prove the closure under product ln\the case of a direct product of 
free monoids, indeed if M, N axe two monoids with M c N a set X c M 
which is recognisable in N is also recognisable in M. Let M be the 
family of all monoids in which the family of recognizable sets is closed under 
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product. If M and /V belong to 
fact, if X f y are recognisable in 
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M then also MxN 
MxW we have 



belongs to M. In 
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with ft/, 7) « ftect/W and S /P V/ e Rec<N> . Then 
i.J 

and therefore XV e Reo(MxN) . since any free monoid belongs to the family 
M, any direct product of f*ee monoids axso belong© to H. a 

The closure under the star operation presents a difficulty. In fact, 
it is not true In general that the star X* of a recognisable subset X of 
a commutation ronoid is recognisable. It is enough to consider the case of X 
= [<U1>) in the monoid NxN since 

X* = {(n,n> I n > 0} 

Which is the diagonal of NxN and which is not recognisable. 

The following theorem is the best result Xnown at present cm the 
closure under the star operation, m present it without proof. 



theorem «.2. Let M = M(A,&) oe a commutation monoid. Let X fie s 
recognizable subset of M satisfying the following condition : for each word 
<s A* representing an element of x, the restriction of the complement of 
the set of letters appearing In x is connected. Then the set X is 
recognizaoie in M. 

This result is due to *. Ketivier 112). it has been obtained by 
successive generalisations of previous ones. First of all, the result was 
proved byM.F. Pie and <#. Roucairol [91 under the stronger hypothesis that 



x 

e ro 



in X there is only one word in A* 



<'i> X is finite • . 
(ii) for each element 
representing X . 

in a further work, done by R. Cori and myself t&J, we were able to 
replace hypothesis (i) by the lesH restrictive hypothesis that X is 
recognizable. The result was also proved lndependantly by M. Latteux. Finally, 
y. Ketivier has succeeded to adapt our proof to obtain Theorem 4.2 in which 
condition (ii) is stated in a much weaker torn. 



EXAMPLE 4.1. fcet A - {a.o.c.C) 
conpletnent by the figure below 



and O be given by the graph of its 
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The commutation monoid M<A,e> is the same as in Example 2.1, 
consider the sets 

v a "C/"« a i gjf<x , y i? x) 

By Theorem 4.2, the set Y is recognisable . It is in fact recognised 
by the finite automaton given be Tow 
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